Exact Expected Average Precision of the Random Baseline for System Evaluation

نویسنده

  • Yves Bestgen
چکیده

Average precision (AP) is one of the most widely used metrics in information retrieval and natural language processing research. It is usually thought that the expected AP of a system that ranks documents randomly is equal to the proportion of relevant documents in the collection. This paper shows that this value is only approximate, and provides a procedure for efficiently computing the exact value. An analysis of the difference between the approximate and the exact value shows that the discrepancy is large when the collection contains few documents, but becomes very small when it contains at least 600 documents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deriving the Exact Cost Function for a Two-Level Inventory System with Information Sharing

In this paper we consider a two-level inventory system with one warehouse and one retailer with information exchange. Transportation times are constant and retailer faces independent Poisson demand. The retailer applies continuous review (R,Q)-policy. The supplier starts with m initial batches (of size Q), and places an order to an outside source immediately after the retailer’s inventory posit...

متن کامل

Exact and Efficient Generation of Geometric Random Variates and Random Graphs

The standard algorithm for fast generation of Erdős-Rényi random graphs only works in the Real RAM model. The critical point is the generation of geometric random variates Geo(p), for which there is no algorithm that is both exact and efficient in any bounded precision machine model. For a RAM model with word size w = Ω(log log(1/p)), we show that this is possible and present an exact algorithm...

متن کامل

شناسایی شاخص‌های کلیدی سنجش عملکرد افراد برای پرداخت پاداش

The current performance evaluation process are inspired by performance management within the organizations to take a step further in a way to the factors such as competency, merit, capacity for improvement and promotion are supposed to be taken into account the in addition to performance evaluation itself. Nowadays, the organizations prefer to set more accurate criteria for performance evaluat...

متن کامل

A Fast Baseline System for Large Scale Bird Identification

We present a description of our approach for the “Bird task Identification LifeCLEF 2015”. Our approach consists of a baseline system based on the classification of Mel-bands representations of bird singing using a random forest classification. This setting proved to be fast during testing, extraction of Melbands and classification was done in a couple of hours. Our best system reached a Mean A...

متن کامل

Reliable location-allocation model for congested systems under disruptions using accelerated Benders decomposition

This paper aims to propose a reliable location-allocation model where facilities are subject to the risk of disruptions. Since service facilities are expected to satisfy random and heavy demands, we model the congested situations in the system within a queuing framework which handles two sources of uncertainty associated with demand and service. To insure the service quality, a minimum limit re...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015